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TWO-STEP ESTIMATION OF ERGODIC LEVY DRIVEN SDE 


HIROKI MASUDA AND YUMA UEHARA 


Abstract. We consider high frequency samples from ergodic Levy driven stochastic differential equation 
(SDE) with drift coefficient a(x, q) and scale coefficient c(x,'y) involving unknown parameters a and 7 . 
We suppose that the Levy measure uq, has all order moments but is not fully specified. We will prove the 
joint asymptotic normality of some estimators of a, 7 and a class of functional parameter f (p(z)iyo(dz), 
which are constructed in a two-step manner: first, we use the Gaussian quasi-likelihood for estimation 
of (q:, 7 ); and then, for estimating f (p(z)iyo(dz) we make use of the method of moments based on the 
Euler-type residual with the the previously obtained quasi-likelihood estimator. 


1. Introduction 

It is widely recognized that a diffusion model is a typical candidate model to describe the high activity 
time-varying dynamics. However, especially in the biological, technological and financial application, 
there do exist many phenomena where driving noise process exhibits highly non-Gaussian behavior. A 
jump-type Levy process may serve as a suitable building block in modeling such phenomena. In this paper, 
we consider a high frequency data (Xtg,Xt ^,... jXt^) from the one-dimensional Levy driven stochastic 
differential equation (SDE): 

dXt = a(Xt,a)dt + c(Xt-,'y)dJt, Xq = xq, ( 1 . 1 ) 

where: 

• a = (ai) and 7 = ( 7 ;) are unknown finite dimensional parameters and we suppose that each of 
them are elements of bounded convex domains C , 0^ C and we write 0 = 0a x 0.^ 
and Pa + Pj = p. 

• The functional forms of the drift coefficient a : Rx 0a —!> K and the scale coefficient c : Kx 0.^ —>■ K 
are known. 

• Jt is a one-dimensional pure jump Levy process with Levy measure uq. 

We denote by Pq the true image measure of X associated with the true value 60 G 0. Note that we 
do not consider the case of misspecification of the functional form of the coefficients. We suppose that 
the path of Xt is not observed continuously but observed discretely at high frequency: we consider the 
samples (Xtg,Xt ^,... ,Xt^), where tj = = jhn for some /i„ > 0 which satisfies that 

—>■ 0 and —>• 00 , 

for n —> 00 and some cq G (0,1). The objective of this paper is to estimate do and the functional parameter 
f (p(z)iyo(dz) for some function tp in a two-step manner. It is not essential in our results that X has no 
Wiener part, but the absence is assumed from the very beginning just for simplicity of the statements; 
see Remark 3.8 for a brief discussion. 

Up to the present, many results about the estimation of the diffusion process (this process corresponds 
to the case of replacing Jt with a standard Wiener process in (1.1)) have been established both continuous 
sampling case and discrete sampling case. In the continuous sampling case, the explicit form of its 
likelihood is given (see, for example, [13]). Hence we can construct the maximum likelihood estimator 
of a and under some conditions, it has consistency and asymptotic normality (for details, see [ 12 ] and 
[17]). In the discrete sampling case, we can not obtain the closed form of its likelihood in general, so 
that we have to consider another method. Typically, we resort to the quasi-likelihood based on the 
local Gaussian approximation. By the Ito-Taylor expansion, [10] gives the estimation scheme in the 
case of nhn —>■ 00 and —>■ 0 (Vg > 2). [ 8 ] shows its local asymptotic normality; he also shows the 

local asymptotic normality in the non-ergodic case. Needless to say, there are many estimation methods 
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besides (quasi) maximum likelihood method (see, for example, [12] and [17]). We emphasize that these 
estimation methods essentially rely on the scaling and hnite-moment properties of Wiener process. 

Construction of an estimator of f (p(z)iyo(dz) is important in the statistical inference associated with 
Levy process. Recall that the class of bounded continuous functions vanishing in a neighborhood of 
the origin completely characterizes uq [18, Theorem 8.7]. In particular, the parameter f (p(z)iyo(dz) 
corresponds to the gth cumulant of Ji for (p(z) = z’^ with q > 2, and also to the cumulant transform of 
Ji for ip{z) = — 1 — iuz, u € K., which is important to assessing the ruin probability in a jump-type 

Levy risk model. The example of moment-htting estimation of f <p(z)iyo(dz) from the discretely samples, 
(Jh„, J 2 h„, ■ ■ ■, Jnh„), are proposed in [6] and [19]. The main claim of [6] says that under some moment 
conditions, for a function ip vanishing in a neighborhood of the origin it follows that 

\/nhn (iE ip{AjJ) - J <p{z)izo{dz^ Af (^0, J ip{z)'^izo{dz)^ , (1.2) 

where AjJ = Jjh„ — J{j-i)hn- However, in the estimation of Levy driven SDE, we encounter the diffi¬ 
culty, that is, (J/i„, J 2 h„, ■ ■ ■, Jnh„) cannot be observed directly. One may think of utilizing a martingale 
estimating function for joint estimation of 6^ and f (p(z)iyo(dz). However, we then have to specify what 
kind of conditional expectation is to be used in an explicit way, which inevitably requires more specific 
structural assumptions about i^oidz) beyond Assumption 2.2. 

Here we will take another route. Previously, [15] used the Gaussian quasi-likelihood, which can apply 
to a large class of Levy processes, making it possible to construct Gaussian quasi maximum likelihood 
estimators (GQMLE) = (dn, %) of the true value do = (^o, 7o) without any specific information about 
the noise distribution; also, [15, Theorem 2.7] shows that it has consistency and asymptotic normality 
with rate ^nhn- By making use of the GQMLE and the functional-parameter moment fitting, we will 
propose a two-step procedure for joint estimation of 9 and f (p(z)iyo(dz): we first estimate a and 7 by 
GQMLE, and next construct the estimator of / (p(z)iyo(dz) based on Euler-Maruyama approximation. We 
still do not presume the closed form of the noise distribution, so that our way of estimation is beneficial in 
terms of the robustness against noise misspecification. Further the proposed two-step procedure enables 
us to bypass simultaneous optimization problem, which may result in high computational load. 

The organization of this paper is as follows. In Section 2, we will introduce notations and assumptions 
for our main results. Section 3 provides our main results: the stochastic expansion 



and the asymptotic normality of our estimators; see (3.2) for the explicit form of 5„. In particular, the 
second term of the right-hand side reflects the effect of plugging-in the Vn/i„-consistent estimator % into 
the scale components of the Euler-residual sequence. All the proofs of our main results are presented in 
Section 5. 


2. Notations and Assumptions 

2.1. Notations. We denote by (H, (J^i)tgR^, P) a complete filtered probability space on which the 
process X is defined, the initial variable Xq being J^o-measurable and Jt being J'j-adapted and independent 
of Ao. 

For abbreviation, we introduce some notations. 

• Eq]-] denotes the expectation operator with respect to Pq and we abbreviate f (p(z)iyo(dz) to 
120 ((p). 

• For differentiable function f,dxf stands for the derivative with respect to any variable x and df 
represents the vector of the derivatives of the components of /. 

• tj . — jhji- 

• stands for the conditional expectation with respect to 

• AjZ stands for Zt^ — for any process Z. 

• E, := E;7 ™<i/, :=/tr 

• rj{x,9) := a{x,a)c ^{x,'-f) and M{x,6) := daa{x,a)c ^(x,j). 



TWO-STEP ESTIMATION OF ERGODIC LEVY DRIVEN SDE 


3 


• fs '■= fiXsjdo) for any function / on M x 0; e.g. at{a) = a{Xt,a) and Mt{9) = M{Xt,9). 

• We will write x„ < ?/„ when there exists a positive constant C such that Xn < Cyn for large 
enough n; C does not depend on n and varies line to line. 

We define the random functions G^{9) G and G'l{9) £ by 


G:(9) 

Gl(9) 


(a)), 




” <-.(7) j’ 


and the corresponding GQMLE ([15]) by 

9n := argmin|(G“( 6 »),G;(( 6 »))|, 
See 


where 0 denotes the closure of 0 and | • | the Euclidean norm. 

We introduce additional notations associated with GQMLE. 

• fs ■= fiXs,9n) for any function / on R x 0; for notational brevity, we also use the notation 
dgfj-i instead of dgfj-i. 

• ■= and Sj := 

• Vn := y/nhf{9n - 6 »o) and := y/nhf{% - 7 o)- 


2.2. Assumptions. For our asymptotic results, we introduce some assumptions. 
Assumption 2.1 (Sampling design). nh\ —>• 0 and —>■ oo for cq € (0,1). 

Assumption 2.2 (Moments). We have E[Ji] = 0,i?[jf] = 1 and E'[|Ji|'^] < oo for all q> 0. 


Although we only assume the moment conditions on Ji , the first and the third formulae are valid for all 
t > 0, see [18, Theorem 25.18] and we have E[jf] = t from the expression of characteristic function of Jt- 
Further, by the definition of Levy measure and the fact that E'[| Jt|^] exists if and only if \z\'^i'o{dz) 

(see [18, Theorem 25.3]), we see that f \z\'^no{dz) < oo, for all g > 2 under Assumption 2.2. 


Assumption 2.3 (Smoothness). (1) The drift coefficient a{-, ao) and the scale coefficient c{-are 
Lipschitz continuous. 

(2) For each i € {0,1, 2} and k £ {0,1,..., 5}, the following conditions hold: 

• The coefficient a{x,a) and c{x,'y) have partial derivatives dl.d^a {x, a) and dl.df^c{ x,'y), and 
all the functions a i-A d^d^a {x, a) and 7 i-A dl.df^c{ x,j) for each x G M (including a >—>■ 
a(x,a) and 7 i-A c(x,"/) themselves) can he continuously extended to the boundary o/0. 

• There exists nonnegative constant C(i^k.) satisfying 


sup -- 7 —^ - 

(a:.a,7)GRxec,xeT 1 + [xj^UA) 


{|a;a^a(x,a)| 


\did';cix,j)\ 


|c \a;, 7 )|} < oo. 


( 2 . 1 ) 


In this paper we will assume that X is exponentially ergodic together with the boundedness of moments 
of any order. Let Pt denote the transition probability of X. Given a function p : R —>■ R"*" and a signed 
measure m on one-dimensional Borel space, we define 

||m||p = sup{|m(/)| : / is R-valued, m-measurable and satisfies |/| < p} . 


Assumption 2.4 (Stability). (1) There exists a probability measure ttq such that for every q > 0 we 
can find positive constants a and c for which 


sup e“‘||Pt(x,-) - 7 ro(-)|l 3 < cg{x), 

tGR+ 


where g{x) := 1 -I- |x|®. 
(2) For all q > 0, we have 


sup Eo[|At|'^] < oo. 
tGR+ 


X G R, 


( 2 . 2 ) 


The condition (2.2) corresponds to the exponential ergodicity when g is replaced by 1. When some 
boundedness conditions about coefficients and their derivatives are assumed, moment conditions written 
in above can be weakened (see [15, Section 5] for easy sufficient conditions for Assumption 2.4). 
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Let Goo(0) := {0^(9), GUl)) G R'’ define by 

GZ>i^) = [ {a{x,ao) - a{x,a))7ro{dx), 

G2o(^') = ‘^ J “ c^ix,-f))TTo{dx). 

We need to impose some conditions on Gao{0) for the consistency of a and 7 . The sufficient condition 
for the consistency of general M(or Z)-estimator is given in [20]. 

Assumption 2.5 (Identifiability). There exist nonnegative constants Xa and x-y such that 


\G^{e)\>Xc,\a-ao\, |G2^(6»)| > X 7 I 7 - 7o| for all 9. 


Define I( 6 »o) :=diag{I“(6io),2:'^(6'o)} G Mp 0 

I"{9o) = J 
I^(0o) = 4 J 


Rp by 

(5aa(a;,ao))®^ 


c^(a;,7o) 
{d^c{x,xo))^‘^ 
c^(a;,7o) 


7 To{dx), 

TToidx), 


where := xx'^ for any vector or matrix x and T means the transpose. The matrix T{9o) plays a role 
like a Fisher-information like quantity in GQML estimation. 


Assumption 2.6 (Nondegeneracy). I“(0o) andI^(9o) are invertible. 


Our estimation of ’'^dl be based on (1.2). Here we only think of Euclidean space valued (p, while 

treatment of complex tp being completely analogous. In our setting, we only observe high frequency 
sample {Xh, X 2 h„, ■ ■ ■ ,Xnh„)i hence we need to approximate AjJ to estimate Let A denote the 

formal infinitesimal generator with respect to Levy process J, that is, 


Ap{x) 


J(pix + z) - p{x) - dp{x)z)vo{dz), 


(2.3) 


for any p such that the integral exists. In what follows we fix a positive integer q. We now define a 
positive constant p fulfilling that 

p > (1 - eo) V j5, 

where cq is the same as in Assumption 2.1 and /3 denotes the Blumenthal-Getoor index of J defined by 


/3 = inf jy > 0; ^ |zPj^o(dz)| . 

Denote by /C the set of all M^-valued functions on M such that its element / = {fk)l;^i : R —>■ satisfies 

the following conditions: 

(1) / is five times differentiable. 

(2) There exist nonnegative constants Gi (0 < f < 5) such that 

lim sup I-^ 1 /( 2 :)I -h 7^19/(2)11 < 00 , 

sup I p. \d^fiz)\ < 00 , iG{2,3,4,5}. 

zSR ^ \Z\ * 

We now impose 

Assumption 2.7 (Moment-fitting function), p € K. 


Then, according to the definition of Blumenthal-Getoor index and Assumption 2.2 we have vaip) < 00 . 
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3. Main results 

The Euler-Maruyama approximation says that 

This suggests that we may formally regard Sj as the estimator of J, and indeed it will turn out to be 
true under our assumptions. Also, we will see that the Euler residual 6j, which is constructed only by 
{Xh„,X 2 h„, ■ ■ ■, Xnh„), may also serve as an estimator of AjJ (see the proof of Theorem 3.1). 


\ n ^ / 


As was mentioned in the introduction, we know that is asymptotically normally distributed: —> 

A/'(0, vo(‘P®^))- Let 

C{z) := zd(p{z). 

The next theorem clarihes the effect of using the statistics Sj instead of the unobservable variables 


Theorem 3.1. Under Assumptions 2.1-2.7, we have 


Un = Un + bn[Wn] + Op(l), 


where G is defined by 


N=-(ii:cA))«(ii:^). (3.2) 

3 3 3 ^ y 

Building on the stochastic expansion (3.1), we will see that substituting into AjJ leads to the 
different asymptotic covariance matrix of the estimator of po('^)- See the comments after Corollary 3.5 
for more details. 

Remark 3.2. Although GQMLE is adopted as the estimator of Oq, (3.1) is valid for any estimator 
which satisfies E[\^nh„{0n — ^*o)|®] < oo for all q > 0 (cf. the proof of Theorem 3.1). 

We define the estimating function G„(0) for {9,vo{(p)) by 


Gn{e) = 


Vnh„ 


u„,G“(0),G;((d) , 


where G“(6l) and Gif {9) are defined in the previous section. Introduce 


with Ell G 


Si2 = (Si2,fe;)fel G 


Ell Ei2 

^12 l;22 


and E 22 = (A, 22 ,ki)k,i G 


where 


Sii = 


J (pk{z)zizo{dz) J — 

2 J ipk{z)z'^i3o{dz) J 


dociG,{x^ CXq) \ ^ 1 ^ ^ 

»(*,7o) °( ) * - - 

f (p„ + l </</>), 

J c(x, 7 o) 


E22.fc( = \ 4 


da^a{x,ao)daia{x,ao) 

21 \ To (da;) 

c^(a),7o) 

f dj^,c{x,jo)d-f,c{x,jo) 


{k,l G {1, . . . ,Pa}), 


^(a;,7o) 


9a^a(x, Qo)cI-),,c(x,7o) 

c^((E,7o) 


7ro(dx) J z'^noidz) (fc, I G {pa + 1,... ,p}), 

-7ro(dx) j z^i/oidz) {k £ {1 ,... ,pa} ,l £ {pa + 1, ■ ■ ■ ,p}). 
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Theorem 3.3. If Assumptions 2.1-2.5 and Assumption 2.1 hold, and ifH is positive definite, then 

y^G„{9o) ^ Mp+giO,E). 

Remark 3.4. The moment convergence of the estimator is crucial for detecting the asymptotic behavior 
of statistics which can be used, for example, derivation of information criteria, mean bias correction and 
investigation of mean squared prediction error; see the references cited in [15]. As for the GQMLE On, 
under Assumption 2.T2.6 we can deduce 

E[fivn)]^ [ fiumu;0,IiOo)-^E22iIiOo)-Y)du, 

JRP 


for every continuous function f : M.P ^ M. of at most polynomial growth: see [15, Theorem 2.7]. In this 
paper, we do not go into details of the moment convergence of f{un). 


Define the statistics f „ G Rp+9 0 Rp+’S' by 


f „ = ri 


O -de{GZ,GZ)i0n) 


where Bn = {O 6„) G R* 0 R^. We also define 


Ei2,n ^ 22 ,n 


with G R”? (gjR'?, {t,i 2 ,n,ki)k,i G R”? (gR^* and {t, 22 ,n,ki)k,i G 


where 




1 daidtj_i 


-E 


n ^ ct._i 

j \ 

1 


n ^ ct._i 
3 ^ 


\ dakdtj-idaidtj_i 

n 2 —^ ^2 


(1 < l<Pc), 


(Pa + 1 < ^ < V), 


ikfl € { 1 ,..., 


W,= .ri). 


j , .. ^ 

^ dakdtj_^d-y^Ctj_i \ / 1 y- 

71 I \ Tih.- 


E'^i ) (fc G G {pa + l,...,p}). 


It will turn out that S„ is a consistent estimator of the asymptotic variance E, which depends on the 
true value 19q,vq{p)) under our assumption. 

By use of Theorem 3.1 and Theorem 3.3, we can derive the asymptotic normality of the statistics 
(unjVn) only constructed from the observed data {Xh„, X 2 h„, ■ • • ,Xnh„). 

Corollary 3.5. Suppose that Assumptions 2.1-2.7 hold and that E is positive definite. Then S„ A E 
and 

AAr(0,4+,), (3.3) 

where Ip denotes the p x p identity matrix. 

By means of Lemma 5.6 and Lemma 5.9, we can observe that 

bn 5o := - C{z)k'o{dz)^ ® (^J ~^~^^oidx)^ ■ 

Put Bq := (O bo) G R”^ 0 R^. Under our assumptions, we can deduce that 

..ki f„Ar:=([; 

Thus it follows from (3.3) that we have the joint asymptotic normality of our estimators: 
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Remark 3.6. Recall that if Jt is the standard Wiener process, then the rate of 7 „ — 70 is \/n (see 
[10] The case f z^iyo(dz) = 0 corresponds to this. As was noted in [15] (and as trivial from Lemma 
5.6 and Lemma 5.7), and 7 „ are asymptotically orthogonal (hence asymptotically independent) if 
f z^voidz) = 0. Likewise, the asymptotic independence between 9n = (d„, 7 „) and (jihn)~^ g:’{Sj) 
can be easily seen from the expression of S: in particular, (dn) 7 n) is asymptotically independent of 
inhn)~^ (p{Sj) if both f Lpk{z)zvo{dz) and f tpk{z)z'^i'o{dz) are zero. 

Now we assume that the Levy measure vq is parametrized by a parameter ^ G 0j, say where 
is a bounded convex domain in and that there exists a true value fo G 0j. The delta method then 
leads to the following corollary. 

Corollary 3.7. If the conditions of Corollary 3.5 hold and the equation F(f ip{z)v^{dz),6) = (f,0) has 
a C^-solution : K® x 0 —>■ 0j x 0 such that dF{v^,j{ip), Oq) is invertible, then 

dFn := dF(^^Y.v{5,), A dF(n^,(g.), 9o). 

\ n j / 

Moreover, we have 

_ ^ 0 , On - 0o) A Ar(o,4+,), 

where fn denotes the random vector consisting of the first q elements of F{j^ 'Yfj ^n)- 


Remark 3.8. As a matter of fact, the absence of the Wiener part in the underlying SDE model (1.1) is 
not essential in our results. Consider 

dXt = a{Xt,a)dt + a{Xt,j)dWt + c{Xt-,'f)dJt, (3.4) 

where W is an {Ft)-adapted standard Wiener process independent of {Xo,J). We first note that the 
results of [15] still ensures the asymptotic normality of the corresponding CQMLE of (a, 7 ) at rate y/nKf), 
in exchange for, in particular, some stringent identifiability condition on the scale parameter 7 ; e.g. if 
6 (x, 7 ) = 7 i and c{x,j) = 72 for 7 = ( 71 , 72 ), then trivially we cannot estimate 71 and 72 separately by 
the naive Gaussian quasi-likelihood. Introducing an additional condition, we could deduce Theorem 3.1 
with the same the expression (3.2) of bn, except for the trivial change of the form ofT,, which stems from 
the necessary modification of the “one-step” variance in construction of the Gaussian qua.si-likelihood 
corresponding to (3.4), that is, from “hc(._fi'-^)” to “h{a(._fi'y) + see Eq.(2.10) and the 

expression of Ho in [15, pages 1600 and 1601] for details. More specifically, in the derivation of (3.1), 
which amounts to the stochastic expansions and estimates concerning the terms bn'^ and bn^ in the proof 
of Lemma 5.8, it turned out that the presence of the Wiener part entails an additional condition on the 
behavior of the second derivative of (p around the origin, in order to make the remainder terms in the 
Taylor expansion indeed negligible. Also to be mentioned is that the independence between W and J is 
crucial in the computation of the leading-term of (3.1); formally, in applying [7, Lemma 9] we make use 
of the calculations such as E^~^[g{AjJ)AjW] = 0 and E(~^[g‘^{AjJ){AjWfi] = hnE[g‘^{AjJ)] to obtain 
ftj-ig{AjJ)AjW = Op(l) for suitable f and g with g(0) = 0. Building on these observations, 
the proofs in case of (3.4) go through as in (1.1) without further difficulty, while the precise statement 
concerning the model (3.4) requires a series of changes of notation. We would like to omit details of the 
full picture. 


4. Numerical experiments 


Consider the following one-dimensional Levy driven SDE: 

dXt = -aXtdt - —^dJt, Xo = 0, (4.1) 

i -I- 

where the true value is (ao, 7 o) = (0.5,0.2); the driving noise process is the normal inverse Gaussian 
Levy process such that £(Jt) = NIG{6, 0,6 t, 0) with <5=1,5, or 10. It is well known that the cumulant 
function of Ji is explicitly given by 


= J{cos{uz) — l)vo^s{dz) = (5((5 — \/5'^ F u^). 


k{u) logE[exp(iwJi)] 


(4.2) 
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Table 1. The performance of the two-step type estimators with <5 = 1 and the true 
value (ao, 7o, k(1), n(3), k( 5)) = (0.5,0.2,—0.4142,—2.1623,—4.0990); the mean is given 
with the standard deviation in parentheses. 


Tn 

hn 


7 « 


k ( 3 )„ 

K{5)n 

10 

0.05 

0.6609 

0.1923 

-0.4455 

-2.2888 

-4.0765 



(0.3912) 

(0.0489) 

(0.0340) 

(0.5594) 

(0.9300) 

50 

0.025 

0.5394 

0.1967 

-0.4243 

-2.1713 

-4.0020 



(0.1396) 

(0.0224) 

(0.0346) 

(0.2801) 

(0.4929) 

100 

0.01 

0.5205 

0.1986 

-0.4198 

-2.1730 

-4.0755 



(0.0994) 

(0.0163) 

(0.0292) 

(0.2087) 

(0.3746) 


where denotes the Levy measure of NIG{S,0,S,0). Hence it follows that E[Ji] = 0,E[jf] = 1, and 
that T(Jt) as 5 —>■ oo. 

In addition to (ao)7o), we estimate the value of k{u) for some u, so by the symmetry of C{Jt) we set 
the moment Htting function ip(x,u) = cos(ux) — 1. Note that the SDE model (4.1) satisfies all of our 
assumptions; see [15, Proposition 5.4] for the stability condition. 

Put T„ = nhn- Our simulations were done for {Tn,hn) = (10,0.05), (50, 0.025) and (100,0.01) with 
respect to each 6 . We simulated 1000 independent sample paths for true model with sufficiently small 
step size by use of Euler scheme and the 1000 estimates (d„, 7 „, k(1)„, k(3)„, k( 5)^), where 

,_. 1 ” 

" i=i 

were calculated for each sample path. For generating sample paths, we used yuima package [2] for R 
statistical environment [16]. The mean and the standard deviation of (q;„, 7 „, k(1)„, k(3)„, k( 5)„) were 
computed; these are shown in Table 1-3. 

From the results, we can observe the following: 

• the performance of q;„ can be affected not by the value of 5 but by the value of T„; 

• the performance of 7 „ seems to improve in terms of standard deviation as the value of 5 increases, 
which can be thought to come from the fact that the asymptotic variance of 7 „ tends to 0 as 
6 ^ oo (we have / z^uo((i 2 ;) = 35“^); 

• the performance of k('u)„ becomes better for smaller u. This is quite natural because by Theorem 
3.3 the asymptotic variance of k{u)^ is f (p(x, u)^iyo,s(dx). From the half-angle formula (p(x, u)^ = 
(cos(ux) — 1)^ = —2(p(x, u) + ^^{x, 2u), we have 

J (p{x,ufvo,s{dx) = J (^-2(p{x,u) + ^(p{x,2u)^ i^o^sidx) 

^ -|- 4 ^ 2 ^ =: f[6, u). 

Since duf{S,u) = 25{u/^/5'^ -I- -I- 4u'^) > 0 for all 5 > 0, the asymptotic variance of 

k{u)^ is increasing in u, clarifying better performance of «((?)„ for smaller value of q. 

In this example, it should be noted that a large value of u brings about large finite-sample bias and 
variance of the scaled estimators: ^/nh^{K{u)^ — k(m)}, because then both the term bn = bn(u) and the 
Op(I) term in the right-hand side of (3.1) will become large in an increasing way with the value |m|; as 
seen from the proof, the latter term involves higher-order partial derivatives of '^{x, u) with respect to x. 

5. Proofs 

Throughout our proofs, we will often omit “n” of the notation and write E instead of Eq. 

5.1. Preliminary lemmas. We begin with some lemmas. 

Lemma 5 . 1 . Suppose that Assumption 2.1 and Assumption 2.2 hold. For all q>2, it follows that 

^E[\Jh\‘^] -> J \z\‘>no{dz). 


= -^ 6 ^ + 5(2^6^+ u^- 
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Table 2. The performance of the two-step type estimators with (5 = 5 an(i the true value 
(a0)70) k(1), n(3), k(5)) = (0.5,0.2,—0.4951,—4.1548,—10.3553); the mean is given with 
the standard deviation in parentheses. 


T„ 

hn 


7« 


k(3)„ 


10 

0.05 

0.6762 

0.1969 

-0.4898 

-3.7868 

-8.1786 



(0.4052) 

(0.0144) 

(0.0029) 

(0.1469) 

(0.5172) 

50 

0.025 

0.5302 

0.1989 

-0.4921 

-3.9482 

-9.1297 



(0.1421) 

(0.0056) 

(0.0016) 

(0.0725) 

(0.2480) 

100 

0.01 

0.5160 

0.1995 

-0.4939 

-4.0726 

-9.8473 



(0.1000) 

(0.0040) 

(0.0010) 

(0.0479) 

(0.1758) 


Table 3. The performance of the two-step type estimators with 5 = 10 and the true 
value (oo, 70 , «^(1), k(3), k(5)) = (0.5,0.2, —0.4988, —4.4031, —11.8034); the mean is given 
with the standard deviation in parentheses. 


Tn 

hji 

OLji 

In 

«(l)n 



10 

0.05 

0.6785 

0.1962 

-0.4928 

-3.9645 

-8.9678 



(0.4130) 

(0.0109) 

(0.0013) 

(0.0762) 

(0.3412) 

50 

0.025 

0.5250 

0.1985 

-0.4957 

-4.1710 

-10.2291 



(0.1391) 

(0.0039) 

(0.0004) 

(0.0234) 

(0.1184) 

100 

0.01 

0.5160 

0.1994 

-0.4975 

-4.3084 

-11.1378 



(0.0972) 

(0.0023) 

(0.0002) 

(0.0116) 

(0.0649) 


Proof. Under Assumption 2.2, = \z\^ satisfies the condition of [5, Theorem 1]. □ 

Remark 5.2. Although the above convergence might not be valid for all 0 < q < 2, it holds when q > (3, 
where /3 denotes the Blumenthal-Getoor index (for details, see [5, Theorem 1], [9, Section 5.2], and [14, 
Theorem 1]). 

From now on we simply write fj-i{9) = f{Xt^_^,6), fj^i = /(At^_i, 6 »o) and fj-i = /(At^_j,0„). 

Lemma 5.3. Let / : R x ©„ x 1 —>■ R 6e a polynomial growth function with respect to x, uniformly in 

a and 7 . If Assumptions 2.1-2.4 satisfied, then, for all p € {1,2} and q > 0 it follows that 

191 

^ ■^fj-i{e){XjX-haj-i{a)Y\ I < 00 . 


sup sup E 

n 6 


nh 


Moreover, we have 


sup if 


sup if 


- H-i) 


< 00 , 




< 00. 


Proof. First, we show the case of p = 1 and q >2. By the definition of A, we have 

91 

^■^/,_i(0)(A,A-/m,_i(a)) 


E 


nh 


< E 


E 


1 

nh 


Y, f j-iiO) J {as-E^-^[as])ds +E ^ ^/,_i(d) y^(if^-i[a,] - a,_i)(is 


■^/j_i(6»)(aj_i -aj-i{a)) 


■E 


nh 
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We will check separately that all terms are finite. From the assumption on / and Jensen’s inequality, we 
get 

191 

- <oo- 


E 


- - «j-i(«)) 


By Ito’s formula, we have 

E^~^[as\ —Qj-i = f E^~^[Aau]du, 

Jtj-i 

where A denotes the formal infinitesimal generator of X, namely, for / G C^(R), 

Af{x) = df{x)a{x) + J{f{x + c{x)z) - f{x) - df{x)c{x)z)iyo{dz). 

By [15, Lemma 4.5], the definition of A and the assumptions about coefficients and moments, for a 
V G (0,1), we get 

E^~^[Aau] < E^~^ {dxau)au + J{a{Xu + c„z) - a„ - (9a;a„)c„z)uo(dz) 

< E^~^ 1 + + J\dla{Xu + vCuz){cuzf\i^o{dz) 

< E^-^ [1 + |X„|^] < E^-^ [1 + |X„ - X,_if + \X,_if] < 1 + 

Note that we used the fact that f z'^iyo(dz) < oo for any q>2. Hence it follows that 


E 


1 

nh 


j{E^-^[as\-a,.^)ds <E j {E^-^[a,] - a,.,)d. 


< E 


< E 


-^|/,_i(0)| I I E^-^[Aa^] 


i -^b-i 


duds 




91 


< h‘^ < oo. 


Burkholder’s inequality for martingale difference array yields that 

I 91 


E 


^^/j_i(6l)y^(as - ^[as])ds 


< 


n 2 






fj{as-Ei ^[as])(is 


<n-5-iy 

i ^ 


E 


JAas-Ei i[as])(is 


2 q 


< n 2 


< T?,2 


E y ^ [|os - + \aj-i - F;l-I[as]p 9 ] ds 

j V 1 


Define the indicator function Xj by 


Xj{s) = 


< n 2 Vh < oo. 

/ 

1 s G (tj-i, tj], 


0 otherwise. 
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Using this indicator function and Burkholder’s inequality, we can obtain 

I 91 


E 


, ^/j-i(6') I Cs-dJs 


nh 


= E 


-| pYin _ 


< (nh)-^-^ / E 

Jo 


= {nh) 2 U[|/,_i(6l)csnds 


91 


ds 


< (nh) 2 < oo. 

Second, we look at the cases of p = 2 and q> 2. Quite similarly to the above, we have 


(5.1) 


E 




nh 


91 


< E 


EE 


91 




91 


f3-ii^)iaj-i -ai-i(a))' 


+ E 




In the same way, we get 
E 

E 


Mas] - aj-i)c?s^ 


< < oo. 


-^/j_i(6l)(aj_i - aj-i{a)f 


91 


< oo. 


Jensen’s inequality implies that 
1 


E 


nh 


55 /i-i W ( / (as - ^[a^])fis 


< E 


< E 


= E 




1 

n Jo 


nh 


< h<i\E 

nh 




-1 


■E^ 


< h^Vh < oo. 


E y |as - E^-^[as]\'^ds 

E - E^-^[as]\^X3is)ds 
E l/3-i(^')l|as - U^”Mas]Pxj(s) 

3 

\fj-i{e)\‘'\as - E^~'^[as]\^'^ds 


3 

onh 


ds 


From Ito’s formula, we get 
2 

Cs—dJg I — 2 



Cu-dJu Cs-dJs+ / / cj_z^N{ds,dz) 
3 \Jtj-i I Jj ■ 
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= 2 J yJ Cu-dJuj Cs-dJs + J J cl_z‘^N{ds,dz) + J c^ds 

= 2^ Cu-dj\ Cs-dJs + J cl_z‘^N{ds,dz) + J{cl - [c 5 ])(is 


+ J{E^ ^ [cl] - c]_^)ds + 


where N{ds,dz) (resp. N{ds,dz)) is the Poisson random measure (resp. compensated Poisson random 
measure) associated with J. It follows from this decomposition together with a similar estimate to (5.1) 
that 

. nl«-| 

yj^Cs-dJs 


E 


nh 


< oo. 


If 0 = 00, we do not have to consider the term containing Oj_i — aj-i(a). Hence Jensen’s inequality gives 
the desired result for all g > 0. □ 

For the sake of the asymptotic normality of we introduce the function space: 


/Cl - = (fk) ■■ 


/ is of class uo(/) < oo, sup Eq f{Js)\] = 0 {l), 

0<S<hn 


and max J J Eq [\A^ f{Js- z) - f{Js-)\‘^] J^o{dz)ds = 0 { 1 ) >. 


fulfills that /(O) = 9/(0) = 0 and 


Lemma 5.4. If Assumption 2.1 holds and if the function /iM — 

/ G /Cl, then we have 

ii?[/(J,)]-uo(/) = 0 (/i). 

Proof. By Ito-Taylor expansion, we see that 

/(A) = /(O) + hAfiO) + [ f A^f{Ju)duds 

Jo Jo 

+ [ [ {fiJs- +z)- f{Js-)} Nids, dz)+ [ [ [ {Af{Ju- +z)- AfiJu-)} N{du, dz)ds, 
Jo J Jo Jo J 

Under the assumptions we see that the last two terms are martingale (see [1, Theorem 4.2.3]) and 
.4/(0) = uo(/), hence the result follows. 

Lemma 5.5. Suppose that Assumption 2.1, Assumption 2.2 and Assumption 2.1 hold. Then we have 

Un —^ AfqfJ, Ull), 

Proof. By the stationarity and independence of increments of Levy process J, we have 


□ 


= VnhlE'^{(p{AjJ)-E^ ^[ip{AjJ)])\+Vnh^Z^'^E^ ^[ip{AjJ)] - izo{(p) 


nh 

— • “t“ fm 


U 


nh 


where 


e„ = Vnh I ^ XI “ E[ip{Jh)]) j =■ X 

/„ = Vnh (^^E[(f{Jh)] - izoip)^ ■ 

By the previous lemma, it is clear that /„ = o(l) under nh^ —>■ 0. We will prove that e„ —^ A/’q(0, Sn) 
by applying the martingale central limit theorem [3]. First, we show that (pk, PkPi G /Ci where pk denotes 



TWO-STEP ESTIMATION OF ERGODIC LEVY DRIVEN SDE 


13 


fcth component of Lp (in the case of g = 1). We only prove Lp & K-i] the other case is similar. By the 
definition of A (see (2.3)) and Taylor’s theorem, for fixed s £ (0,1), we have 

£1 [|^^(/3(Js) |] < sup E / Alp{Js uz)z^VQ{dz) 

«G[ 0 , 1 ] L J 

Recall that by Assumption 2.2 and the definition of Levy measure, we have / 121 ^ 1 ^ 0 (^ 2 ) < 00 for all 
q > 2. By means of Assumption 2.7 and dominated convergence theorem it follows that 

\d‘^Ap){x)\= d'^ {(p{x + z) - (p{x) - d(p{x)z)i^o{dz)^ ^ 1 + 


for all a; G M. Hence we have 


Similarly, we can show that 


sup E [\A'^ip{Js)\] < 00 . 

0<.S<.hn 


J j E [\‘p{Js- + z) - (p{Js-)\'^] vo{dz)ds < 00 , 

J J E \\A‘p{Js- + z) — Ap{Js-)\^\ izo{dz)ds < 00 . 

Obviously, we have E[ej] = 0. The properties of conditional expectation yield that 

“ E[‘^k{Jh)p^l{Jh)] ~ E[ipk{Jh)]E[pi{Jh)], 

Lemma 5.4 leads to ^ / ‘fk{z)ipi{z)izo{dz). From Assumption 2.2, Assumption 2.7 

and Lemma 5.1, we obtain A[|ejj,|^] = 0{h). Hence we have E[\ej\^] —>■ 0, namely Lindeberg 

condition holds. Combining these discussion, we deduce that e„ —> A/)j(0, Sn) as was to be shown, 
completing the proof. □ 


Define Gn{d) G R*’ by 


G„(0) = (G“(0),G^(0)), 


and it is easy to calculate its derivative deGAd) = ( 

\t>aGAd) c)^Gj^[u) 


as follows: 






Cj-lil) 


-(AjX - haj-i{a)) - 


A-iil) 


d'yGnid) = 9aaj-i[d^TC^\i-f)]{AjX - huj-iia)), 

j 

daGlid) = - ^(AjA - haj-i{a))[d^cJ^^i-f)]daTaj-iia), 


d,GZ(0) = -T E [-af c-_\( 7)](A,A - H_i(a))2 ^ 2h 


c?-i(7) 


Lemma 5.6. Under Assumptions 2.1-2.5, it follows that 

sup G„(0)-Goo(0) Ao, 

0Ge 

VnEGn{9o) A A/'p(0, E22). 

Proof. For simplicity, we do suppose that Po, = = 1; the high dimensional case is similar. First, we 

will show the 0-pointwise convergence 

G„(0)-Goo(0)| Ao. 

From [7, Lemma 9], it suffices to that show 

[Mj-iie)iA,X - ha,.,{a))] A G^(0), 

3 
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nh ^ 


{AjX - haj-i{a))‘^ - h 


2 


a^c2_i(7) 

c?-i(7) 


Po. 


GUl), 


(nh)- 


■ J 2 [|M,_i(0)(A,X - H_i(a))p 


Po. 


(nh)- 




-d^cG^ij)] {AjX - haj-i{a)Y - h 




^l(7) 


Po. 


0 . 


By the definition of A, we observe that 

AjX — haj-i{a) = / (a* — aj-i)ds + / Cs-dJg + h{aj-i — aj-i{a)). 

"^3 '’3 

Hence the martingale property of ^ implies that 


[Mj-i{6){XjX — haj-i{a))] = Mj-i{6) < h{aj-i — aj-i{a)) + 


7-1 


(og — aj-i)ds 


Now, from [15, Lemma 4.5] and supg \M{x, 6*)| ^ 1 + \x\'^ for some C > 0, we have 

3 ^ ^ ^3 \ 


E 


E 

E3-i 

1- 

1 

1 

1_ 

2" 



IJ 3 J 

_ 


j E[\as - aj-i\^]ds 


so the ergodic theorem gives 

1 . 
nh ■ 

Similarly, we see that 



E^- 


- 1970 ^^ 1 ( 7 )] {^jX - haj-i{a)Y - h 


2 , 


cLi(7) 




(og — aj-i)ds + Ji'^s- — Cj_i)fiJg + h{aj-i — aj-i{a)) + cj-iAjJ^ 


hd^c'j_ 


Uii) 




cf-i(7) ’ 


where Cs,i-i •= /j(®s ~ aj-i)ds + fj(cs- — Cj-i)dJs + h{aj-i — aj-i(a)). Applying [15, Lemma 4.5] and 
Burkholder’s inequality, we see that 

<h [ E^-^[\X, - Xj_i\'^]ds + j E^-^[\Xs - Aj.iHds + - a,-i(a))" 

<h\l + \X,.,f), 

and ill'’“^[c|_;^(Aj J)^] = Hence we have \E^~'^[C,sj-iCj-iXjJ]\ ^hi{\ + |Aj_i|'^) by conditional 

Cauchy-Schwarz’s inequality. It follows that 


r 

2' 


r 

2' 



/ (ttg - aj-i)ds 


+ E^-^ 

1 (Cs— Cj — i^dJs 


+ E^-^ 

\h{aj-i 

J 3 



3 





nh 




{XjX - haj-i{a)y - h 


cU^l) 


A, 


GL(7), 
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from the ergodic theorem. Above calculation yields that 

E^-^[\A,X - K_i(a)|2] < ^ 

and obviously, this inequality is valid when we replace 2 with for any q > 2. In the same way we can 
easily see that 


E^- 


{XjX - haj-i{a))‘^ - h 


57^1-1 ( 7 ) 


^i(7) 


< 


- 970 -^ 1 ( 7 )!" E^-^[{AjX - haj-i{a))^] + 




Uil) 




so the ergodic theorem gives 
1 


(nhY 

1 

{nhY 


J2e^~^ |M,_i(0)(A,A - ha,.^{a))f 


Po. 




{AjX - haj-i{a)f - 


Po. 


0 . 


As a result of these computations, we obtain the 0-pointwise convergence 


G„(0)-Goo(0) 


Po. 


0 . 


To prove the uniformity of (5.2), it suffices to show the tightness, which is in turn implied by 


sup A 

n 

In the case of Pa = p-y = <7 = I, we have 

dlaj-ija) 
cLi(7) 


sup 

e 


deGniO) 


< 00 . 


1 




(AjA — haj-i{a)) — h 


{daaj-i{a)Y 

cLi(7) 




3 ^ 

b 


^i(7) 


ag;«i) = ^2SzlWiSzl<l)(A,x - 






c^i(7) 


-(Ay,A-/iaj_i(a))2 


(5.2) 


^a2cj_i(7)cj_i(7) - (a.yCj_i(7))2 I 

S-i(7) J ’ 

and if we impose some regularity conditions on a and c, we can calculate the high-order derivative of 
G„(0) readily. Sobolev’s inequality and Lemma 5.3 imply that for q> p, 


E 


sup 

e 


deGn{9) 


< supi? 
e 


dgGn{9) + a|G„(0) 


< 00 . 


Hence we are able to conclude that [{G„(0) — Goo( 6 *)}eee]nGN is uniformly tight (see, e.g. [11]) so that 


the continuous mapping theorem yields that supgg 0 |G„( 6 *) — Goo ( 6 *)I 0. Moreover, the consistency 
of 9 immediately follows from [20, Theorem 5.3]. We will observe that 

V^G“(0o) = ^ E J + Op(l), 

Vnh Cj-i 


V ^ Gli 9 o ) = ^ Y . 

V nil 


d^Cj-i 
Cj — i 


((A,J)2-/i) +Op(l). 
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Trivial decomposition leads to 


VnhG^{eo) = - Kaj-i) 

ynh 

3 

= ^ ' -^j-i ~ aj-i)ds 

1 - 
Vnh ' 


+ X! [ (Cs- - Cj-i)dJs + —F^ X! “ ^ ^ 

Jj Vnh^ Cj-i 


From this, it suffices to show that Mj-i fj(as — aj-i)ds and Mj-i fj(cs- — Cj-\)dJs are 

Op(l). Notice that \Mj-i\ < (1 + |Xj_ip). As in the proof of Lemma 5.3, we can observe that these 
terms are Op(l). Hence we get 

V^G“(0o) = ^ E J + Op(l). 

Vnh . Cj-i 


It is clear that 


V^GZ{9o) = 7= E I (A, 


X — haj-iY ~ 




Vnh 


'y ^ (ij—i)ds i^Cs— Cj—i)dJ^ 


+ y '^i-iAj*/ (^J(ij-i)ds + J{cs- Cj-i)dJ^ 




^((A, J)" - ft)). 

Cj-I 


By Assumption 2.3, admits a polynomial majorant, so it follows that 


E 


\/nh 


y ^ cij—i')ds A 


= o(l), 


from Lemma 5.3. Similar calculations yield that 
1 


E 


< 


\/nh 


1 


\/nh 


y ^ iAjT (^Cts G.J—l)ds A l)^'^s^ 

(us CLj—i^ds A ^j—i')dJs 


E^ 


|9..yCj_l Aj J| 


< 


Vnh ' 


^/A[Aj J]^ ( \ ^ y^(as - aj_i)ds \ Jics--Cj-i)dJ. 


2" 





+ \ 

E 

/ 

_ 

\ 




In the last inequality, we used the independence of increments of J. By Lemma 5.1, we observe that 
1 


yA[J^] —>• 1, so we see that 


1 


Vnh 




1 


r 

2' 


r 


E 

/ {as - aj-i)ds 


+ \ 

E 

v\ 


? 3 

. 

\ 

L 


J' {^Cs— Cj — i^dJs 


< 


— Vh{h^ + h) < Vnh? = o(l). 
Vnh 


Hence we get 


VVhGli9o) = ^Y. 

Vnh 


d-yCj-l 

Cj-i 


((A,J)2-/r) Aop(l). 
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We define 


v^G“(0o) = ^ E J, 

Vnh Cj-i 


v^G';^(7o) = ^^E 

Vn/i ^ 

From Assumption 2.2, we have 

E^^ 

3 


5^Szl((a^j)2 _ A) j, 

Ct_i 


,7-1 


,7-1 


(9a ttj—1 

Cj-i 

d^Cj-i 
Cj — 1 


AjJ 


= 0 , 


iiA,jr-h) 


= 0 . 


The ergodic theorem and Lemma 5.1 give 
1 


{nh)'- 


{nh)'- 


3 


a 4 

doL(ij-i ^ J 

G_l 


^OL — 1 

1 — 

1 

c 

J (nhY ^ 

Cj-i 

f) ^ 

"7Cj-lffAT)2 k) 


1 1 

[[^3^J 

Cj-i 

J {nW ^ 


E[7lS^=o(l) 


nh 


d-fCj-i 


Cj — 1 


E [{Jl - h)^] < — = 0 ( 1 ) 


so the Lindeberg condition holds. Furthermore we get 




E^- 


-1 


^aE<lj-l9ai<lj-l . j^2 


‘V -1 


da^aj-ida,aj-i . 2-1 _ da^aj-ida^aj-i 


-E[J^n] = 


K 




E^ 


-1 


S-i 


S -1 


^ 1-1 

72 712i _ ^TfeCi-l^^iCj-i 




^ 1-1 


{E[4] + o{h)] 


-A,J{{AjJY-h) 


^ 1-1 


^ak<^j-l9jiCj-i ^ 3.1 


^i-i 


Finally, we apply the ergodic theorem to derive 


1 

nh 


nh 


3 


dg^aj-idgiaj-i ^^2 


Po f daka{x,ao)da,a{x,ao) 




TToidx), 


{{A,jf-hf 




nh 


^ 1-1 

S -1 


Po, f dj^c{x,lo)d^ic{x,lo) 


Ha;, 70 ) 


A, J((A, J)2 - /7) 


Po f da^a{x,ao)djic{x ,-fo) 

J c2(x,7o) 


-7ro((ia;) j z'^no{dz), 

^-TToidx) j z^noidz), 


with which the martingale central limit theorem completes the proof. 

Applying Taylor’s theorem to Gnido), we get 

Gn{0o) = - [ deG{e + u{9o - 9))duy/nh{9 - 6 * 0 ). 

7o 

Note that by the consistency of a and 7 , we can consider G„(0) = 0 a.s., for large enough n. 
Lemma 5.7. If Assumptions 2.1-2.6 hold, we have 


□ 


sup 

\e\<e„ 


0 , where 


\-deGn(9^ + 9)-I(9Y\ 

VYh{9 - 9o) A Ar( 0 , {I{9o)-YE22I{9o)-Y- 
Proof. We may set Pa = P 7 = 1- Define the 2 x 2-valued matrix X{9) such that 

( 6 ») 
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where ( 6 »), and are defined by 


T-(aa)//jN f f 9la{x,a) , . , ss , (9aa(a;, a))^ \ ,, ■, 

I Hx,a)-a{x,ao))+ |^o(rfx), 

2:(a.7)(5/) = j 

J c Tj 


daa{x, a)d-yc(x, 7 ) 
c^{x,'y) 
(5^c(x,7))2 


(a(x, ao) — a(x, a))7ro(dx), 


-2(x,7) 

As in the previous lemma, we can prove 


-5eG„(6») Al(0), for all 6 ». 

By Assumption 2.3, it immediately follows that for all k G {1,2, 3,4}, dgGn{9) can be decomposed as 


1 


k) 


d^G^e) = - ha,_i(a)) + 

3 

where and are functions of Art ^_4 at most polynomial growth uniformly in 9. Hence 

the Sobolev’s inequality implies that 
mapping theorem gives 

\-deG^{e^ + e)-i{e^) 




is uniformly tight and the continuous 


nSN 


sup 

|e|<<=n 


0 , where 


0 . 


Further, the continuity of 1{0) and the consistency of 9 give 


- / deGi9 + u{9o-9))du-^I{9o). 


Assumption 2.6 ensures that lim„_>oo ^ ~ fo dgG{9 + u{9o — 9))du >0^ =1, hence we can suppose 

that — fg dgG{9 + u{9q — 6 ))du is invertible for all n large enough. Hence, applying Slutsky’s lemma, we 


have the desired result. 


□ 


Obviously, it follows from Lemma 5.7 that —dgGn{&) can serve as a consistent estimator of I(do)- In 
the same way, we could provide a consistent estimator of the asymptotic variance of 9, making it possible 
to construct a confidence region. 

We introduce the following function space: 


/C 2 


f = ifk) 


f is of class 


— max E 
h 


\df{S,)\^ 


1 

— max sup E 
h l<j<n„g[o,l] 


\df{/S.jJ + u{5j 




0 ( 1 ), 


0 ( 1 ), 


and VAT > 0, 


max sup E 
i<i<"„g[o,i] 


+ u{5j 




By use of this class we can prove: 

Lemma 5.8. Suppose that Assumptions 2.1-2.5 hold and that (p € IC 2 . Then we have the stochastic 
expansion: 

‘p{5j) - r'o(¥’) j ® d-f{c~\ j)cj-iAjJ[wn] + Op(l), 

^ j ' 3 

where we also have ® dryic~3i))cj-iAjJ = Op{l). 

Proof. First we decompose the left-hand side as 


Vnh 



” ' 1 

i=i J 
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—: + Un- 


Let us first prove bh = Op(l). Applying Taylor’s theorem, we see that 


^ - Aj J))dM (Jj - AjJ). 

Wnh Uo 


By definition of Sj, it follows that 


Aj^ — Sj = Cj};^^{cj-lAjJ — AjX — haj-i) = Cj\ ~ ~ • 


As in the proof of Lemma 5.6, we have 




for all q > 2. Applying Cauchy-Schwarz’s inequality we get 


A[|6f|]<^ij i 5<^(A,J + u(<5,-A,J))ciu \A,J-6,\ 

a(p(Aj J + u{Sj - AjJ))du E[\AjJ - 5j\^] 

< A max sup E \\d(p{AjJ + u{Sj - A^ J))|^j E[\AjJ - 6j\^] 

Vn i<j<n y h ugfo,!] L 


< 'Jnh? = o(l). 

Next we turn to bn^. By Taylor’s theorem, we have 

i>!” =- t) +y^li / / vd'‘-p{S,+uv{Sj-Sj))dvdu{i,-Sjf . 

+e">- 

For notational convenience, we denote by R(x) a generic matrix-valued function defined on R x 0 for 
which there exists a constant C > 0 such that sup^ \R{x, 9)\ < C{1 + \x\'^) for every x; the argument 9 
is omitted from the notation, and the specific form of Rj-i appearing below may vary from line to line. 
From the definition of 5j and 5j, 

^3 ~ ^3 ~ S-i(AjA — hdj-i) — Cj_i{AjX — haj-i) 

= (c7_\ - c~\)AjX - h{T}j_i - (5.4) 

Again applying Taylor’s theorem, we obtain 

\^3 - ^ A |57c 7-\(7)|) kHAjAp fsup \d9T]j_i{9)\\ |hp 


< — {\u,\^\A,X\^ + h^\v\^)\R,.,\ 

<L^\A,X\^ + h^)\R,.,\\v\^. (5.5) 

A similar argument to the proof of Lemma 5.3 gives the estimate E[\Rj-i\E^~^[\AjX\'^]] < h for all 
q > 2. By means of these estimates and Holder’s inequality we can deduce that, for sufficiently large 
p> 2 and sufficiently small <? > 1, 


^ AtTI / / vd^PiSj+uv{Sj - 6j))dvdu \6j - Sj 

ynn ^ Jo Jo 
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< 


J_±| 

'/nh nh 




vd^ip{Sj + uv{Sj — Sj))dvdu 


0 -'0 




1 l,.,o 1 


< >- — \v 


\^nh h 


E 


1 rl 


vd‘^ip{6j + uv{5j — 5j))dvdu 


0 ^0 


uj -T uvyuj uj/ 

-I 


-^{|i?,_i|(|A,X|2 + /,2)}A 


< 


< 


VnhJ^ 

As for bn’^\ we first observe that 
1 


X 0,(1) E j X Op(l) + Op{h^) 




[ [ vdf'^{cj}^){'yo + uv{'j--fo))dvdu 

Jo Jo 


Uo Jo 

In a similar way to the estimate of \bn’‘^'^\, it follows from the definition of IC 2 , the tightness of (w), and 
Cauchy-Schwarz’s inequality that 


{nh)~^'^d(p{6j)AjXw'^ / / vdf'^{cj\){'yo + uv{^ - jo))dvdu 

j LJo Jo 

< (nh)-^ Y. X Opil) 

3 

1/1 \ ^^^ / 1 \ 1/2 

(^i:iA,xpifi,_.i) .0,0) 


W 


<Or 


^Vnh^ 


.(I)- 


We also have 

fh 


- Vj-i) ^ E f dgrjj-iieo + u{e - eo))di 

\ Tl . Tl . Jq 


— Op(l). 


We thus get 


=:/r„[w]+ Op(l). (5.6) 


It remains to take a closer look at G ■ Substitute the expression 

AjX = / agds + / (cs_ — Cj-i)dJs + cj-iAjJ 

J 3 J 3 

into (5.6) and observe that 

Y J_'^sds(^d(p{Sj) (g) dj{c-\)^ ^ ^ E 

^3 ^ ^ 3 

<Op{Jh), 


1 

nh 


l-Rj-i|( ^ j la^l ds 


1/1 
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and similarly that, by using Burkholder’s inequality (conditional on 

1 

nh 
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< 


< 


/ (cs- - Cj-i)dJs{d^p{5j) ® 

3 ^ 

-y] - Cj-i)dJs 


2 . 1/2 


Therefore Mn = ;5r X),® d^{C''7li))cj-i/S.jJ + Op(l) and we also get 


E 


nh 

1 

nh 




< 




1/2 


1/2 


n —' h 

3 


■E^ 


\R,.3\^-E[\A,jf] 


= 0 ( 1 ), 


hence the proof is complete. 


□ 




C 

E 

^3 - ^3 



5.2. Proof of Theorem 3.1. In order to obtain (3.1), we first show that actually Lp G IC 2 and C G /Cin/C 2 
(recall the notation C(z) = zd(p(z)). As in the proof of Lemma 5.5, it follows that C G /Ci. From the 
proof of Lemma 5.1 and Lemma 5.8, for all C > 2, we have 

max E [\AjJ-djf] = 0{h^), max E = 0{h). 

l<j<n 

Moreover, [15, Theorem 2.7] and (5.5) give 

< {nh)-^E [(|AjA|<^ + h^) \Rj_i\\vf] = O ((n/i)-^, 

for any a G (0,1). Hence the Chebyshev’s inequality yields that 

max sup \p (|Aj J + u{5j — AjJ)\ > M) V P (jJj + u{5j — Jj)] > m\ | = 0{h). 

We will use these estimates without notice below. By the condition on dp}, we have 

\dip{AjJ + u{Sj - AjJ))\^ 

1 |A, J|2 + \6, - A,jf + |A, J|2(i+c^i) + \6, - A, J|2(i+c^i) I = 0{h). 


sup E 

tiSlO.l] 


< E 


In the same way as above, we also obtain E 
exists a constant C > 2 such that 

. . . iK 


d(p{5j) 


= 0(h). By Assumption 2.7, for all AT > 0, there 


d ipiSj+u{Sj - 6j)) 


< 1 + 
< 1 + 


^3 ^3 


^3 ^3 


c 


+ |A/J-^/| + |Aj Jj'^, 


so it is straightforward that 


max sup Eq 

l</<"ug[0.1] 

< 1 + max Eq 

1< j<n 


d (p{6j + u{6j - 6j)) 

c 


K' 


^3 ^3 


+ |A,J-<5,r+ |A,jr 


= 0 ( 1 ). 


Hence (p G IC 2 ', similarly ( G IC 2 - 
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Now we have Sj — — aj-i)ds + Cj\ fj{cs — Cj-i)dJs; then, E[E^ Jp]] < 

Plugging-in the expression d(p{6j) = dip{AjJ) + {6j — AjJ) d‘^(p(AjJ + u{Sj — AjJ))du and then 
applying analogous estimates under Assumption 2.7 as before, we can deduce that 

— — AjJ) J d (p{AjJ + u{6j — AjJ))du 0 cj-iAjJ 


1/2 




1 

— Y / d^(p{AjJ + u{Sj — AjJ))di 
^ Jo 


1/2 


\Rj-i\^\A,J\^^ 


1/2 


< 0,(1) X - ^ |A, Jp(l + |A, J - 5,f + \A,jf) = 0,(1). 


It follows from [5, Theorem 1] and Lemma 5.8 that under the present assumptions about C we have 
}jE[C{AjJ)] = MO + o(l) = Ej 00) + OpO)- Therefore, 

{OMJ) ® 9^icJ-i)} Cj-I + OpO) 


+ ^ J2{)0MJ) - EKMjJ)]) ® Cj-i + 0 ,( 1 ) 


= -iA[C(A,J)]®(l^ 


1 ^^ d^Cj—\ 


n ^^ Co_i 

3 


OpO) 




n 

3 ' ^ 3 


1 O-yCj — l 


Cj — i 


Op0)j 


where we used the martingale central limit theorem together with Burkholder’s inequality for the third 
equality. Thus the proof is complete. 

5.3. Proof of Theorem 3.3. From Lemmas 5.5, 5.6 and 5.7, it suffices to show that 


n h 


{(pk{AjJ) - E[(pk{AjJ)]) 


d< 

Cj-i 

Cj-i 


(pk{z)zMJO / ^^EEiE7rp(dx), 


c(a;,7o) 


((A,J)2-/i) 


Po. 


(Pkiz)z‘^MJO 


d^,c{x,M 

cO,lo) 


7ro(da 


Assumption 2.2 yields that 


^ai ^j — 1 
Cd—i 


AM 


^Ctl Ojj — l 


Cl — 1 




Similarly, we have 

{iPk{AjJ)-E[MMJ)]) 


^11^3 

Cj-i 


((AM)^-h) 


Cj-i 


{E[MJh)Jl]- hEMiJh)]} 


From the proof of Lemma 5.5 we can readily observe that zip, z'^p € ICi. Hence the ergodic theorem and 
Lemma 5.4 lead to the desired result. 


5.4. Proof of Corollary 3.5. For the construction of asymptotic variance, we define the function space: 

/Ca = = (/fc) : K ^ 


/ is of class M, E |9/(Aj J -I- u{6j — Aj J))|^ = o(l), 


i</<"ue[o.i] 


1 


and —TT max sup E 

nhz i<i<n„g[o i] 


df{5j -f u{5, -60) 


= 0 ( 1 ) . 


The following lemma gives sufficient conditions for a given function to belong to /C 3 . 
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Lemma 5.9. Assume that an 'MA-valued or 0 'EA-valued function f is differentiable and there exist 
nonnegative constant D such that liinsup^_»Q |^|i-eo \9f{z)\ < oo and limsup 2 _^go Y:j:|7pTl^/('2)l < OO) 
where cq is given in Assumption 2.1. Then f € IC 3 . 

Proof. Dividing the events and applying Holder’s inequality, we have 
1 


max sup E 
nh^ i<i<n„g(o,i) 


\dfiSj +u{6j -Sj)\^ 


= —max sup E 
nh i<j<n„g(o,i) 


\df{Sj + u{6j - Sj))\^; \6j + u{Sj - 4)| < 1 


——r max sup E 
nh^ i<i<n„g(o,i) 


\df{Sj + u{Sj - ( 5 j))p; \Sj + u{Sj - ,5j)| > 1 


1 


2 (l-eo) 


2 (l-eo)' 

^ —TT max E 

nh'^ i<j<n 

S 3 

+ 

Sj ~ Sj 



—r max ( E 
nm i<i<n 


1 + 


2D 


^3 ^3 


(p ( 


S 3 - Sj 


> 1 


l-£0 


° nhi+<’o 

The other condition can be verified as well. 


□ 


First we note that Assnmption 2.7, the mappings z i-A , z(p{z), z'^(p{z) satisfies the conditions of 
Lemma 5.4 and Lemma 5.9. Let us show that for any / € /Ci fl /C 3 we have 




(5.7) 


From a similar decomposition to that in the proof of Theorem 3.1, we have 




nh 


nh 


/(A, J) - Mf) \ ■ 


nh 


Then Lemma 5.4 implies the last term is Op(l). Taylor’s expansion and Holder’s inequality yield that 


3 

1 1 

Vnh'nh 


^ ^ ^ f'(5j + m (( 5 j - 5j))du{5j - 5j) 


< 


E 


f'{5, +u{Sj - Sj))di 


sup 1 9c( 7 ) I |AjA 


1 1 V 

'/nh n 


f{Sj +u{ 6 j-Sj))du 


sup|r 7 j_i( 6 »)| ) \v\ 
e 


< 


\ (nhy 


■E 


f'{Sj +u{6j - 5j))di 


^i;E7P|0T.(7)l’AAI" =< 0,(1)+ 


Hence, using the conditioning argument together with iii[| Aj Ap] < hwe obtain ^ fiSj)~/h ^3 
Op(l). Recall that E[\AjJ — 5/^] /, hf, from which 


nh 


< 


^ “ E ■f'(^ 3 J + u{Sj - AjJ))du X 


hence (5.7) follows. 
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Now, (5.7) and Lemma 5.9 yields that Sii. As in the proof of Lemma 5.6, it follows that 


sup 

6»ee 


1 da^aj-i{a)da,aj-i{a) f dai,a{x,a)da,a{x,a) 


-E 

r7 < ^ 


j=i 


c?-i(7) 


c2(a;,7) 


TToidx) 


Po. 


0 . 


Hence the consistency of 9 and the continuity of the map 9 f '^o(dx) implies that 

^ ^akdj-ida,dj-i Po f da^a(x,ao)daia{x,ao) 

-2^ -72-^ / -ITTTT-T- Mdx). 


n 


i=i 




c^(a;,7o) 


Similar estimates and Slutsky’s lemma lead to Si 2 .n ^12 and E 22 ,n S 22 . The desired result 

follows from Theorem 3.1, Theorem 3.3 and Slutsky’s lemma. 

5.5. Proof of Corollary 3.7. From the result of Theorem 3.1, Yj=i ~ ^oiv) = Op{l). Hence 
the continuity of dF and the invertibility of dF{pQ{(p),9o) yield the first result. Finally, [20, Theorem 
3.1] leads to the second result. 
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